Dynamic hardware detection by divyashreepathihalli · Pull Request #71 · keras-team/remote

divyashreepathihalli · 2026-03-05T22:19:33Z

This PR enhances the hardware parsing capabilities of keras-remote, allowing users to specify generic hardware requests (e.g., gpu-16, tpu-512) and automatically provisioning the most appropriate accelerator. It removes the strict requirement for users to know exact GKE hardware topologies, improving the overall developer experience while maintaining strict backward compatibility.

Dynamic GPU & TPU Fallback: Added logic to dynamically query the hardware registry for generic strings (e.g., matching gpu-N and tpu-N).
Generation-Aware Prioritization: Overhauled the fallback search to iterate through a predefined list of preferred hardware (_PREFERRED_GPUS and _PREFERRED_TPUS). This ensures requests like tpu-512 automatically provision the newest available generation (e.g. v4 or v5p over v2), rather than failing back to deprecated hardware based on dictionary insertion order.
Canonical Accelerator Aliases: Added native regex alias support for names like v5e and ghostlite, mapping them cleanly back to the canonical v5litepod representation under the hood. This ensures backend Kubernetes node pools are labeled consistently (tpu-v5litepod-xxxx) instead of creating fragmented node pool topologies.
Topology & Regex Hardening: Cleaned up overlapping regex matches (e.g., _MULTI_GPU_RE matching TPU strings) to improve parsing efficiency and prevent edge-case false positives. All associated unit tests in accelerators_test.py have been updated to assert proper canonical alias resolution.

gemini-code-assist · 2026-03-05T22:19:50Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly upgrades the hardware parsing and provisioning capabilities within keras-remote. It allows users to specify hardware requirements more abstractly, such as "gpu-4" or "tpu-512", and intelligently selects the most suitable accelerator, prioritizing newer generations. This change streamlines the developer experience by abstracting away complex GKE hardware topologies while maintaining backward compatibility and expanding the range of supported hardware configurations.

Highlights

Dynamic Hardware Provisioning: Introduced logic to automatically provision GPUs and TPUs based on generic requests like "gpu-16" or "tpu-512", removing the need for exact GKE topology knowledge.
Generation-Aware Prioritization: Implemented a preference-based fallback mechanism for TPUs and GPUs, ensuring that newer hardware generations are prioritized when generic requests are made.
Canonical Accelerator Aliases: Added support for aliases like "v5e" and "ghostlite" to map to canonical TPU names, ensuring consistent backend labeling.
Expanded Hardware Support: Updated GPU and TPU specifications to include new hardware types (P4, P100) and expanded available counts/topologies for existing ones (L4, A100, V2, V3, V4, V5litepod).
Improved Parsing Robustness: Refined regex patterns for accelerator parsing to prevent false positives and enhance efficiency, along with comprehensive unit test updates.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog

keras_remote/core/accelerators.py
- Expanded supported GPU counts for L4, A100, and A100-80GB.
- Added P4 and P100 GPUs to the GPUS registry.
- Extended TPU topology options for V2, V3, and V5litepod.
- Introduced V4 TPU specifications with various topologies.
- Defined _TPU_ALIASES for "v5e" and "ghostlite" to map to "v5litepod".
- Added DEFAULT_GPU, DEFAULT_TPU, _PREFERRED_GPUS, and _PREFERRED_TPUS constants.
- Modified regex patterns (_MULTI_GPU_RE, _TPU_CHIPS_RE, _TPU_TOPO_RE) to be more flexible and accurate.
- Implemented new parsing logic in parse_accelerator for generic "cpu-N", "gpu", "tpu", "gpu-N", and "tpu-N" requests, including generation-aware fallback.
- Updated parse_accelerator to correctly resolve TPU aliases for bare names, chip counts, and topology strings.
keras_remote/core/accelerators_test.py
- Added a test case for parsing "l4-2" GPU.
- Updated the invalid GPU count test from "l4x8" to "l4x16" to reflect new L4 counts.
- Added a test for parsing "cpu-8".
- Introduced new test cases for generic "gpu" and "tpu" requests.
- Added tests for generic "gpu-N" and "tpu-N" requests, including dynamic fallback scenarios (e.g., "gpu-16" falling back to A100, "tpu-4096" falling back to V4).
- Added tests to verify correct resolution of "v5e" and "ghostlite" TPU aliases.
- Included tests for unsupported generic GPU and TPU counts to ensure proper error handling.

Activity

No specific activity (comments, reviews, progress updates) has been recorded for this pull request yet.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a significant improvement to hardware parsing by allowing dynamic and generic accelerator requests, which greatly enhances the user experience. The generation-aware fallback and support for canonical aliases are excellent additions. My review focuses on a couple of opportunities to refine the implementation by reducing redundancy and code duplication in the new parsing logic, which will improve long-term maintainability. The accompanying test updates are thorough and provide good coverage for the new features.